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Closed Circuit Tele-Vision surveillance systems are frequently the subject of 
debate. Some parties seek to promote their benefits such as their use in 
criminal investigations and providing a feeling of safety to the public. They 
have also been on the receiving end of bad press when some consider 
intrusiveness has outweighed the benefits. The correct design and use of such 
systems is paramount to ensure a CCTV surveillance system meets the needs 
of the user, provides a tangible benefit and provides safety and security for 
the wider law-abiding public. In focusing on the normative aspects of CCTV, 
the paper raises questions concerning the efficiency of understanding 
contemporary forms of ‘social ordering practices’ primarily in terms of 
technical rationalities while neglecting other, more material and ideological 
processes involved in the construction of social order. In this paper, a 
360-degree view presented on the assessment of the diverse CCTV video 


surveillance systems (VSS) of recent past and present in accordance with 
technology. Further, an attempt been made to compare different VSS with 
their operational strengths and their attacks. Finally, the paper concludes with 
a number of future research directions in the design and implementation of 
VSS. 
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1. INTRODUCTION 

CCTV (Closed Circuit Tele-Vision) is one of the most widely used physical security technologies. 
A surveillance camera is a video collection device installed at a particular location and utilized for a variety 
of purposes. As CCTV performance has become enhanced recently, technology is being developed that 
attempts to perform automated processing through facial recognition using the facial information acquired 
from a CCTV system [1]-[5]. However, if these technologies are exploited maliciously, privacy may be 
seriously violated. A set of communication equipment devices that collect image information from a 
surveillance camera device installed at a particular location, and transmit the images via an opened 
wire/wireless communication channel, so that only specified persons can receive it [6]. 
a. Image monitoring control server 

A server that stores, manages, and monitors the image information received from a surveillance 
camera. The image monitoring server is composed of several modules such as encryption, decryption, facial 
area detection, privacy protected image, image saving, and monitoring. The monitoring module can be 
located behind the en/decryption module or privacy protected image module, depending upon whether 
privacy protection is to be applied or not, while monitoring the image [7], [8]. 
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b. Client 

A system or user that seeks to receive and use the CCTV image from the image monitoring and 
control server. Desktops, laptops, and mobile phones can be an example of clients. The client is composed of 
the en/decryption, facial recognition, and image utilization modules [16]. 

c. Facial Area Detection 

A process that must be executed before recognizing the face, and which detects the image spot 
where the face is located. Generally speaking, “facial area detection” refers to the phase that identifies major 
facial parts such as the face shape, eyes, nose, and mouth, whereas the “characteristics extraction phase” 
refers to pre-processing after facial area detection, as well as facial feature extraction for the face 
area [14], [15]. 

Our goal in this paper is to provide a clear overview of various video survelliance system (VSS) 
suggested by various researchers and provide a guide for further design and development in this area. The 
contributions of our work are listed as follows: 

1. We summaries the various categories of VSSs. 
2. We classify the current state of VSS and outline several major open attacks are possible in VSS. 
3. We provide design goals for the development of future VSS which can be integrated into existing VSSs. 

This paper is organized as follows. Section 2 gives an overview of various VSSs. Section 3 we 
review main attacks taxonomies for video surveillance systems. Section 4 provides we provide a set of 
recommendations that can help improve the design and development, which includes security and privacy 
levels provided by the hardware, the firmware, the network communications and the operation of video 
surveillance systems. Section 5 concludes our work. 


1.1. Overview of surveillance system 
a. CCTV Systems 

With the development of the Internet network, the network based CCTV is now widely used in our 
society. In particular, CCTV is used for crime prevention, and the scope of utilization is gradually expanding. 

Video cameras are either analogue or digital, which means that they work on the basis of sending 
analogue or digital signals to a storage device such as a video tape recorder or desktop computer or laptop 
computer [9], [10]. 
b. Analogue 

Can record straight to a video tape recorder which are able to record analogue signals as pictures. If 
the analogue signals are recorded to tape, then the tape must run at a very slow speed in order to operate 
continuously. This is because in order to allow a three-hour tape to run for 24 hours, it must be set to run on a 
time lapse basis which is usually about four frames a second. In one second, the camera scene can change 
dramatically. A person for example can have walked a distance of 1 meter, and therefore if the distance is 
divided into four parts, i.e. four frames or "snapshots" in time, then each frame invariably looks like a blur, 
unless the subject keeps relatively still [11]-[13]. 
c. Digital 

These cameras do not require a video capture card because they work using a digital signal which 
can be saved directly to a computer. The signal is compressed 5:1, but DVD quality can be achieved with 
more compression (MPEG-2 is standard for DVD-video, and has a higher compression ratio than 5:1, with a 
slightly lower video quality than 5:1 at best, and is adjustable for the amount of space to be taken up versus 
the quality of picture needed or desired). The highest picture quality of DVD is only slightly lower than the 
quality of basic 5:1-compression DV [17]-[19]. 
d. Network 

IP cameras or network cameras are analogue or digital video cameras, plus an embedded video 
server having an IP address, capable of streaming the video (and sometimes, even audio). Because network 
cameras are embedded devices, and do not need to output an analogue signal, resolutions higher than closed- 
circuit television 'CCTV' analogue cameras are possible. A typical analogue CCTV camera has a PAL 
(768x576 pixels) or NTSC (720x480 pixels), whereas network cameras may have VGA (640x480 pixels), 
SVGA (800x600 pixels) or quad- VGA (1280x960 pixels, also referred to as "megapixel") resolutions. 
e. Digital still cameras 

The pixel resolution of the current models has easily reached 7 million pixels (7-mega pixels). Some 
point and shoot models like those produced by Canon or Nikon boast resolutions in excess of 10 million 
pixels. 

At these resolutions, and with high shutter speeds like 1/125th of a second, it is possible to take jpg 
pictures on a continuous or motion detection basis that will capture not only anyone running past the camera 
scene, but even the faces of those driving past. These cameras can be plugged into the USB port of any 


CCTV Surveillance System, Attacks and Design Goals (Muthusenthil B) 


2074 O ISSN: 2088-8708 


computer (most of them now have USB capability) and pictures can be taken of any camera scene. All that is 
necessary is for the camera to be mounted on a wall bracket and pointed in the desired direction [18]. 


CCTV 
surveillance 
system 
Owners / Operators F; \ 


Police & Justice 
System 


Figure 1. CCTV Surveillance System 


As shown in Figure 1, the CCTV system is composed of various wire/wireless surveillance cameras 
connected to an image monitoring control server, as well as the client [19]. The CCTV system transmits and 
receives image data via a wire/wireless communication channel, as is composed of various components, such 
as the surveillance camera, image monitoring control server, authentication and access control server, mobile 
phone, desktop, and laptop. 


2. OVERVIEW OF VIDEO SURVEILLANCE SYSTEM AND CATEGORIES 
In this section, we review relevant literature on CCTV System and various categories of video 
survielliance systems. 


2.1. CCTV system 

CCTV-based surveillance networks are widely used for security in public places. An important 
installation problem is to assign a camera schedule with which the system can choose which camera to record 
its video frames to storage at a certain time. Because of the resource constrained and bandwidth limitation, 
only a subset of raw video frames can be recorded for a camera. For security concerns, it is expected that the 
captured frames from the same camera should have equal temporal distance between any two successive 
ones. If all the distances are not the same, there is jitter. Kuan Jen Lin et al. [17] have developed the 
formulation of the scheduling problems is presented. Furthermore, efficient scheduling algorithm is proposed 
to find feasible schedules given a jitter bound. Experimental results show the efficiency and practicability of 
the proposed algorithms. 

Efficiency and robustness are the two most important issues for multi object tracking algorithms in 
real-time intelligent video surveillance systems. We propose a novel approach to real-time multi object 
tracking in crowds, which is formulated as a maximum a posteriori estimation problem and is approximated 
through an assignment step and a location step. Observing that the occluding object is usually less affected by 
the occluded objects, sequential solutions for the assignment and the location are derived. A novel dominant 
color histogram (DCH) is proposed as an efficient object model. 

The DCH can be regarded as a generalized color histogram, where dominant colors are selected 
based on a given distance measure. Comparing with conventional color histograms, the DCH only requires a 
few color components (31 on average). Liyuan Li et al. [18] have proposed multi object tracking method is 
based on a generalized color histogram a novel DCH that is shown to be robust to color and brightness 
changes. The proposed assignment step includes the estimation of orders of visibility, sequential assignment, 
and exclusion. The proposed location step contains visible order estimation, sequential operations of mean- 
shift location, and exclusion. Our tests on a large number of videos and real CCTV systems showed that the 
proposed method is able to track multiple objects through crowds in a high rate of success. 
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Ming Ying et al. [19] have suggested that the visual design method should be introduced and 
advantages of spatial analysis and information query of GIS and 3D city model should be fully used to aid 
CCTV monitoring system in design and maintenance. 

Pradeep K. Atrey et al. [20] have attempted to solve the problem of dynamically selecting and 
scheduling the four best CCTV views. We adopt a human-centric approach in which the system computes the 
operator’s attention in the CCTV views to automatically determine the importance of events captured by the 
respective cameras. The experiments show that the proposed method helps a human operator in identifying 
important events occurring in the environment. 

Roy Coleman et al. [21] have concerned to chart the establishment and uses of CCTV within the 
location of Liverpool city center. In doing this the paper seeks to contextualize CCTV within contemporary 
‘partnership’ approaches to regeneration which are reshaping the material and discursive form of the city. 
Thus CCTV schemes along with other security initiatives are understood as social ordering strategies 
emanating from within locally powerful networks which are orderly regeneration projects. In focusing on the 
normative aspects of CCTV, the paper raises questions concerning the efficiency of understanding 
contemporary forms of ‘social ordering practices’ primarily in terms of technical rationalities while 
neglecting other, more material and ideological processes involved in the construction of social order. 

Tae Hyung et al. [22] have examined the next generation functions and usages from the experience 
from the disaster and safety management system, especially CCTV, in Gimpo Smartopia, Korea. and used 
new hybrid methodology to rebuild requirements and design concepts in terms of their functions, service 
types, usages gated ratings. Based on the examination results, this paper suggested sketched the architecture 
and assessed implications and considerations for the public application development in the disaster and safety 
management. 

Young-Jin Han et al. [23] have proposed a security framework that protects personal information 
obtained from the detected facial area. We pointed out the security threats within CCTV system, and then 
propose a counter measures. The key point of this framework is processing the mosaic or scrambling 
methods to the facial area, so that the facial information cannot be directly obtained without knowledge of the 
secret key. When the original facial information is needed, such as crime investigation, it can be obtained 
through reverse scrambling. This framework will contribute to protect privacy and develop the biometric- 
based physical security technology area. 

Sergio Saponara et al. [24] have proposed to exploit the on-board closed circuit television (CCTV) 
security system to enable advanced services not only for surveillance, but also for safety, automatic climate 
control, e-ticketing. The new system has minimal hardware and installation cost overheads, since it exploits 
the already installed CCTV cameras. In addition, for each wagon, an embedded acquisition and processing 
node (EAP) is used, composed by a video multiplexer, and by a digital signal processor that implements 
algorithms for advanced services such as: smoke detection, to give an early alarm in case of a fire, or people 
detection for people counting, or fatigue detection for the driver. The information is then transmitted from 
each EAP node to the train information system. The final terminals can be the tablets of the train staff, and/or 
visualization displays in each wagon in case of fire alarms for the passengers. 


2.2. Network based video surveillance system 

Hoang Thanh Nguyen et al. [25] have presented a systematic approach by detailing the design, 
implementation, and evaluation of a large-scale wireless camera network, suitable for a variety of practical 
real-time applications. We take into consideration issues related to hardware, software, control, architecture, 
network connectivity, performance evaluation, and data-processing strategies for the network. We also 
perform multi objective optimization on settings such as video resolution and compression quality to provide 
insight into the performance trade-offs when configuring such a network and present lessons learned in the 
building and daily usage of the network. 


2.3. Security based surveillance system 

Anabham Bhavani et al. [26] have focused on the design and implementation of a low cost smart 
security camera with night vision capability using Raspberry Pi (RPI) with PIR. The system was designed to 
be used inside a warehouse facility. It has human detection and smoke detection capability that can provide 
precaution to potential crimes and potential fire. The credit card size Raspberry Pi (RPI) with A passive 
infrared sensor (PIR sensor) handles the moving body, control algorithms for the alarms and sends captured 
pictures to user’s email via Bluetooth. As part of its alarm system, it will play the E-speech sounds: 
“intruder” when there is detection. The system uses ordinary webcam but its IR filter was removed in order 
to have night vision capability. With help of LDR it will sense whether it is night or day if it is night the led 
will on when it detect intruder. 
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Sonali Hargude et al. [27] have developed the abandoned object detection system. This paper 
describes methodology for Intelligent Surveillance system. Proposed Real time abandoned object detection 
System helps to reduce harms causing due to unattended objects. In this paper, we have covered a detail 
discussion on the various stages of any abandoned object detection technique. As future enhancement we can 
take live video feed through android mobile. As everyone is using Smartphone it will be easier to have quick 
looked at the system though android application. It will provide flexibility to the user or the authorized 
person who can keep watch from a distance. 

Grigore M. Havarneanu et al. [28] have covered at least five relevant issues which significantly 
contribute to the prevention of railway suicide and trespass, and mitigation practice: (1) collating details 
across a wide range of countries of what is happening in terms of prevention, data on incidents and processes 
for investigation and the management of suicide and trespassing incidents, etc.; (2) developing and using 
methodology for the evaluation of extensive sets of measures; (3) providing recommendations for further 
examination of selected preventative measures; (4) looking for additional empirical support for a sample of 
selected measures; and (5) providing a toolbox with guidance materials and best practice examples to help 
IMs and RUs implement measures more effectively tailored to their specific needs. 

S.Naga Jyothi et al. [29] have proposed the real time security surveillance system using IoT. The 
system design uses Motion Detection algorithm written in Python as a default programming environment. 
This significantly decreases the storage usage and save investment cost. The algorithm for Motion Detection 
is being implemented on low processing power chip Raspberry pi 2 and Pi camera, which enables live video 
streaming with detection of moving objects and get alarm when motion is detected and sends photos, videos 
to a cloud server directly using pi camera. When cloud is not available then the data is stored locally on 
raspberry pi and sent when the connection resumes. The camera is mounted on the motor and its movement 
(Left/Right) is controlled through IoT webpage by the user, thus providing user with enhanced view of the 
surroundings. 

Patricia Marie L et al. [30] have presented the design and construction of a system consisted of five 
mobile robots (mobots) and a communications system that will serve as a security surveillance system. This 
is implemented using a microcontroller as the core that enables the mobots to work cooperatively. The 
mobots are free to move within their designated areas and are capable of relaying messages via ZigBee 
communication to a base controller system. The purpose of this system is to have an alternative or even a 
complement to regular CCTV surveillance, especially in buildings with several rooms. This would enhance 
the security as the system utilizes a database to store the information gathered from intruder alerts. 
Furthermore, the communication radios used transmit with low power over a long range. The mobots are 
enabled to relay data via mobot-to-PC and mobot-to-mobot paths up to five hops. The data transmitted allows 
the base controller system to identify its source for intrusion detection. 

Hyowon Lee et al. [31] have presented a system under development based on users interacting with 
detected video objects. We outline the suite of technologies needed to achieve such a system and for each we 
describe where we are in terms of realizing those technologies. We also present a system interface to this 
system, designed with user needs and user tasks in mind. 

Alan J. Lipton et al. [32] have outlined two examples indicating that AVS based on computer vision 
technology is a useful piece of the solution for asset protection, perimeter monitoring, and threat detection. 
The Logan airport example demonstrates that this technology is desirable over other technologies because it 
is passive, relatively inexpensive, operationally effective, and provides real-time, actionable intelligence. 
This technology, however, comes with the caveat that the customer must become educated about its 
underlying technology and its applicability. Many proponents of computer vision technology are advocating 
commercial systems that do not perform adequately in real-world environments - they are subject to poor 
detection rates and high false alarms rates in realistic, unstructured environments. At Objectvideo, we 
strongly recommend that potential customers trial the technology in their own unique environments to 
determine the utility of this technology and its adaptability to environmental pressures. Our example shows 
that the Object video system is, in general, extremely effective as a turnkey system - and in cases with unique 
environmental phenomena, our system is rapidly adaptable to overcome operational concems. 

Sunniva F Meyer et al. [33] have explored various trade-offs between standoff and other values, 
and, when appropriate, proposes possible solutions to such dilemmas. Second, it asks whether employing the 
SFF in the FGC of Norway will help illuminate these ‘troublesome trade-offs’. The analysis has 
demonstrated that standoff creates challenges for other purposes of the FGC, such as functional office spaces 
for all employees, but many of these challenges can be solved by planning intelligently, such as creating an 
external commodity reception. Standoff also creates opportunities for reinforcing social-responsibility 
requirements, such as accessibility for pedestrians and environmental considerations. The current literature 
has mostly focused on negative externalities of security, while this paper demonstrates that security measures 
can have both negative and positive externalities and that planning might alleviate some of the negative ones. 
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The results, furthermore, support Little’s (2004) notion about thinking holistically about protection to create 
robust and effective security, and show that the academic community can assist in such holistic thinking. 

Hae-Min Moon et al. [34] have proposed the human identification method that uses height and 
clothing-color information appropriate for the intelligent video surveillance system based on smartcard. It can 
obtain reliable feature information using smartcard. In this paper, representative colors are extracted by 
applying tree-based color quantization technique to the clothing region and height is extracted from the 
geometrical information of the images. Identification is accomplished by comparing the similarities between 
two data based on Euclidean distance. From the experiment, we could see that the identification of a human 
can be checked through the proposed system. 

Robin Singh Sidhu et al. [35] have proposed information processing techniques for CCTV based 
surveillance systems employed in (a) work environments and (b) public places and transport, for automated 
identification of scenes of inter-personal crime. Although both the scenarios presented in this work employ 
similar signal processing and learning algorithms, the objective involved are significantly different. In (a) we 
aim to preserve confidentiality and privacy of official meetings and discussions, while ensuring detection of 
unbecoming behavior, like: bullying, harassment and assault. In the proposed method we identify such 
critical conditions using a combination of image and speech processing and ensue conditional video 
recording and saving. In (b), the target is to identify the occurrence of interpersonal crime using video and 
voice processing, in order to raise alert at the local surveillance station, which may be receiving numerous 
CCTV videos from neighboring areas. This can be an assistance to the security personnel, responsible to 
monitor large number of screens. The proposed methods can be useful curbing interpersonal violence, and 
crime against women, in the form of eve teasing, and harassment. 

Analytical surveillance can perform the surveillance tasks much more efficient comparing to 
operator manual monitoring. This had made it getting increased market’s interest in recent years. Commonly, 
closed circuit television (CCTV) is used for security surveillance. However, CCTVs are purely vision output. 
These silent videos may not provide complete picture of the happening. Sound detection is incorporate into 
vision surveillance for enhancement. Sound detection is able to detect abnormal sound although happen at 
camera blind spots or due to intentional blocking. 

Tan Teng Teng et al. [36] have proposed to use microcontroller embedded system to enhance 
current CCTV system. Proposed abnormal sound embedded system is to carry out the sound detection, audio 
processing and analysis. This study is using only single microphone for sound detection. Audio amplitude 
and frequency range are targeted feature extracted from Fast Fourier Transform (FFT). Abnormal sound of 
human screaming and glass breaking were classified using decision tree. From experiment, proposed 
abnormal sound analytical surveillance system test yield average of 88% accuracy detection. We can 
consider our work is simple and cost effective for field implementation. 


2.4. Video based surveillance system 

Shaokang Chen et al. [37] have reviewed state-of-the-art face recognition techniques for still images 
and video sequences. Most of these existing approaches need well-aligned face images and only perform 
either still image face recognition or video-to video match. They are not suitable for face recognition under 
surveillance scenarios because of the following reasons: limitation in the number (around ten) of face images 
extracted from each video due to the large variation in pose and lighting change; no guarantee of the face 
image alignment resulted from the poor video quality, constraints in the resource for calculation influenced 
by the real time processing. We then proposed a local facial feature-based framework for still image and 
video-based face recognition under surveillance conditions. This framework is generic to be capable of still- 
to-still, still-to-video and video-to video matching in real-time. Evaluation of this approach is done for still 
image and video based face recognition on LFW image dataset and MOBIO video dataset. 


2.5. Video visualization system 

Video visualization (VV) is considered to be an essential part of multimedia visual analytics. Many 
challenges have arisen from the enormous video content of cameras which can be solved with the help of 
data analytics and hence gaining importance. However, the rapid advancement of digital technologies has 
resulted in an explosion of video data, which stimulates the needs for creating computer graphics and 
visualization from videos. Particularly, in the paradigm of smart cities, video surveillance as a widely applied 
technology can generate huge amount of videos from 24/7 surveillance. Fozia Mehboob et al. [38] have 
proposed a state of the art algorithm has been proposed for 3D conversion from traffic video content to 
Google Map. Time-stamped glyph-based visualization is used effectively in outdoor surveillance videos and 
can be used for event-aware detection. This form of traffic visualization can potentially reduce the data 
complexity, having holistic view from larger collection of videos. The efficacy of the proposed scheme has 
been shown by acquiring several unprocessed surveillance videos and by testing our algorithm on them 
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without their pertaining field conditions. Experimental results show that the proposed visualization technique 
produces promising results and found effective in conveying meaningful information while alleviating the 
need of searching exhaustively colossal amount of video data. 

Huan-Ting Chen et al. [39] have developed a robust visualization approach for evaluating the 
coverage of CCTV systems in public building spaces. Firstly, a method for modeling CCTV systems in 
virtual building spaces is presented. The emphasis is placed on offering a visual representation of the CCTV 
coverage in a BIM-based virtual environment. By simulating varifocal lenses and configuring the parameters 
of Revit cameras, the developed approach simulates the CCTV screen views to provide a better visual 
demonstration of the working of the CCTV systems. This is advantageous in the checking of design conflicts 
and effective communication between owners and contractors. The filled regions displayed in the 3D 
environment are also apparent, allowing accurate visual evaluation of CCTV coverage. Finally, in the case 
study of an MRT station, the developed approach is shown to be effective and can be widely applied to other 
building spaces under similar conditions. Table 1 shows the comparison of various methods proposed in the 


security surveillance system. 


Table 1. Comparison of Various Methods Proposed in the Security Surveillance System 


Author Methods Merits Demerits 
Suzhi Bi et Graph-based Cyber Security The graphical characterization of state Solving the complex cyber 
al. [42] Analysis of State Estimation in estimation security provides intuitive security problems are yet to be 
Smart Power Grid. visualization of some complex problem considered. 
structures and enables efficient graphical 
solution algorithms, which are useful for both 
defending and attacking the ICT system of the 
smart grid. 
Jianwei & Reconfigurability Based The proposed scheme meets the security Performance need to be 
Chen et al. Security Service Path requirements and greatly improves the improved in-depth analysis of 
[43] Construction Scheme efficiency of network resources. SAR, determination of network 
trust values, the scheme 
scalability over larger scale 
networks. 
Emanuele Probabilistic Risk-Based The added value of the proposed approach with It gives hidden failures, 
Ciapessoniet Security Assessment of Power respect to conventional security analyses in operators’ delays and delayed 
al. [44] Systems Considering dealing with uncertainty of threats, protection, intervention on 


Incumbent Threats and 
Uncertainties 

Privacy, Security, and 
Reliability for Gesture-Based 


Lucas Silva 
Figueiredo et 


al. [45] Programming 

M. Shamim Toward End-to-End Biomet 

Hossain et al. rics-Based Security for IoT 

[46] Infrastructure. 

Tian-en Distributed Computing 

Huang et al. Platform Supporting Power 

[47] System Security Knowledge 
Discovery Based on Online 
Simulation. 

Mahdi Jamei Micro Synchrophasor-Based 

et al. [48] Intrusion Detection in 


Automated Distribution 
Systems. 


vulnerabilities, and system response. 


Gives better programming and reasoning about 
gesture safety, security, and privacy. 


Sensors or smartphones capture a face image 
and securely transmit it to the IoT platform to 
provide. 


Improves computing efficiency and perform 
better than a centralized platform 


It’s robust, due to its distributed nature; it can be 
used both to verify existing cyber security 
systems and to detect potential cyber attacks and 
it can be inexpensively deployed at existing 
utilities. 


power system. 


Need improvement for 
automatically inferring prepose 
programs by demonstration. 
Fusing can be done in 
multimodal non-invasive 
biometrics in real time to secure 
IoT industries. 

Online simulation-based power 
system security knowledge 
discovery process shows low 
performance. 


More time complexity. 


3. ATTACKS ON VIDEO SURVEILLANCE SYSTEM 


3.1. Visual Layer Attacks 
a. Stage one : 


Malicious firmware update over USB port (or) Remotely via a command injection (or) 


Malicious firmware update over a web interface (or) the VSS or CCTV system could be sold through 
legitimate sales channel with the malware already pre-installed 
b. Stage two: Malicious component is triggered and controlled via a malicious imagery inputs (when such 
imagery is visualized by the cameras and the video sensors). 
c. Hardware based backdoors (attack is implemented in hardware) 
d. In order to guarantee both secure and privacy preserving VSS, a strong light weight hybrid cryptosystem 
(based on multiparty key-sharing scheme) should be deployed in order to prevent privacy and visual layer 


attacks by malware inside the VSS. 
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3.2. Covert channel attacks 

a. Manipulating LEDs/ IR LEDs from software/firmware (CCTV and VSS usually have plenty of LEDs 
both on core equipment as well as CCTV cameras installed outside). 

b. Attacker could send command and control data to the CCTV cameras via the IRLED;s messaging, 

c. Guri et al. [40] presented VisisSploit, a new type of optical covert channel that leak data through a 
standard computer LCD display 

d. Tainting of video frames (process could detect suspicious code which tries to process the video frames) is 
a solution in order to detect such attacks. 

e. Another solution to detect such attacks could be the use of performance counters 


3.3. Steganography attacks 

a. Attacker need to capture the digital image snapshots from well-known URLs and recover the infiltrated 
data. 

b. Automatic methods for steganography detection is needed. 


3.4. Pan-tilt-zoom attacks 

a. PTZ is feature of CCTV cameras that allow them to move in any direction in 3D which is generally 
controlled by PTZ data protocols. 

b. PTZ commands can be sent to PTZ capable cameras from specialized PTZ controls or from software. 


3.5. Audio layer attacks 
Compromised VSS component can use the audio layer as a command and control channel using 
hidden voice command techniques [41]. 


3.6. Denial of service and jamming attack 

Uninterrupted and untampered operation is critically important for VSS and producing a DoS attack 
on CCTV systems even for 1 minute could make them miss an important event. 
Table 2 shows the comparison of various attacks and the solution in video surveillance system. 


Table 2 Comparison of various security attacks and the solutions in video surveillance system 
Name caused by attac olutions Develop 


Attack Type 
Mis yer Costin [49] present st such an on 7 cameras as solution to such was devi y J. 

Attacks the visual layer backdoors. Newsome ef al. [51] of video frames. 
Mowery ef al. [50] implemented a full body scanner as the Another method was developed by Castiglione ef 
secret knock image. al [52] using VSS, a cryptographically-strong 

system could be used similar to the one. 

Covert Though sometimes the LEDs are physically linked to the Guriefal [40] presented VisiSploit, a new type of 

Channels hardware and cannot be controlled from softwarefirmware, optical covert channel that exploits the limitations 

Attacks recent attacks show that manipulatmg LEDs from of human visual perception in order to 
software firmware becomes increasingly practical and feasible unobtrusively leak data through a standard 
[40]. computer LCD display. 


The attacker could then send command and control data to the 

CCTV cameras via the IR LEDs messaging (instead of coded 

visual mages). Such a channel would constitute an addition to 

the classical (W)LAN and Intemet channels used for 

communication and compromise [51]. 
Denial-of- Producing a DoS attack on a CCTV systems even for 1 minute DoS attacks on video surveillance systems have 
Service and could make them miss an important event such as an extremely critical impact and have to be taken mto 
Jamming fast bank robbery [53], consideration durmg design, evaluation and testmg 


Attacks [53]. 


4. DESIGN GOALS ON VIDEO SURVEILLANCE SYSTEM 

4.1. Design requirements 

a. Should be secure but low cost in implementation 

b. Should provide end-to-end system security throughout the entire distribution chain 


CCTV Surveillance System, Attacks and Design Goals (Muthusenthil B) 


2080 O ISSN: 2088-8708 


c. Should sustain current and new heterogeneous environment to attract more applications and more 
customers 

d. Should be scalable from distributed caches and storage device to heterogeneous client devices 

e. Should be extendable from PCs to mobile devices and still remain secure, for flexible new business 
models 

f. Should be easily renewable 

g. Should not reduce the playback quality of the streaming media, i.e., it should not impact continuous 
playback, loss resilient capability, and scalability of the system in real time streaming applications 

h. Should be able to preserve entertainment like experience — users should be able to fast-forward or rewind 
content without any degradation on the viewing or playback experience 


4.2. Design should focus on 

a. Factory reset button-to reset the system to a known factory safe and secure image and state from non- 
volatile , non-writable secure memory chip 

b. Secure scan chains-Implementing secure scan techniques which allows secure debugging, testing, 
restoring without the risk of unauthorized users to gain access and debug. 

c. Remote Software-Implementing remote software technique to ensure the security is not tampered. 

d. Formal proof and verification-Applying some formal proof in order to ensure the hardware design, 
firmware implementation, communication and security protocols. 

e. Standard compliance-Implementing software and hardware security compliance standards. 

f. Artificial Intelligence-Implementing image-tracking facility which must have the features for identifying 
tail-gating, vehicle detection features, unattended baggage identification, queuing analysis, and external 
text insertion feature and intruder detection. 

g. Privacy-Implementing ways that protect the privacy of the individuals including privacy enhancing 
technologies. 


5. CONCLUSION 

Detecting the persons and analyzing their behavior by means of visualization is a prime factor for 
the computer systems nowadays to interact cleverly in a few human populated areas. Visual surveillance 
systems are used for the real time surveillance of targets like persons or vehicles will lead to the description 
of objects’ activities in that environment. Visual surveillance has been used for security observation, anomaly 
recognition, and interloper detection, computing traffic flow, mishap detection on the highways and 
scheduled maintenance. We have provided a detail survey on video surveillance system, attacks and their 
design goals for implementing CCTV video surveillance system. Therefore, there is need of efficient CCTV 
surveillance with higher performance rate and less computation cost. We hope the discussion made in this 
paper will provide a valuable knowledge as well as promote further research and widen the scope of this field 
beyond its current boundaries. 
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